STSM: Spatio-Temporal Shift Module for Efficient Action Recognition

نویسندگان

چکیده

The modeling, computational complexity, and accuracy of spatio-temporal models are the three major foci in field video action recognition. traditional 2D convolution has low but it cannot capture temporal relationships. Although 3D can obtain good performance, is with both high complexity a large number parameters. In this paper, we propose plug-and-play Spatio-Temporal Shift Module (STSM), which effective high-performance module. STSM be easily inserted into other networks to increase or enhance ability network learn features, effectively improving performance without increasing parameters complexity. particular, when CNNs integrated, new may features outperform based on convolutions. We revisit shift operation from perspective matrix algebra, i.e., sparse kernel. Furthermore, extensively evaluate proposed module Kinetics-400 Something-Something V2 datasets. experimental results show effectiveness STSM, recognition also achieve state-of-the-art two benchmarks.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spatio-temporal SURF for Human Action Recognition

In this paper, we propose a new spatio-temporal descriptor called ST-SURF. The latter is based on a novel combination between the speed up robust feature and the optical flow. The Hessian detector is employed to find all interest points. To reduce the computation time, we propose a new methodology for video segmentation, in Frames Packets FPs, based on the interest points trajectory tracking. W...

متن کامل

Human Action Recognition Using Spatio-temporal Classification

In this paper a framework “Temporal-Vector Trajectory Learning” (TVTL) for human action recognition is proposed. In this framework, the major concept is that we would like to add the temporal information into the action recognition process. Base on this purpose, there are three kinds of temporal information, LTM, DTM, and TTM, being proposed. With the three kinds of proposed temporal informatio...

متن کامل

Spatio-temporal Aware Non-negative Component Representation for Action Recognition

This paper presents a novel mid-level representation for action recognition, named spatio-temporal aware non-negative component representation (STANNCR). The proposed STANNCR is based on action component and incorporates the spatial-temporal information. We first introduce a spatial-temporal distribution vector (STDV) to model the distributions of local feature locations in a compact and discri...

متن کامل

Improved Spatio-temporal Salient Feature Detection for Action Recognition

Spatio-temporal salient features localize the local motion events and are used to represent video sequences for many computer vision tasks such as action recognition. The robust detection of these features under geometric variations such as affine transformation and view/scale changes is however an open problem. Existing methods use the same filter for both time and space and hence, perform an ...

متن کامل

Genetic Programming-Evolved Spatio-Temporal Descriptor for Human Action Recognition

The potential value of human action recognition has led to it becoming one of the most active research subjects in computer vision. In this paper, we propose a novel method to automatically generate low-level spatio-temporal descriptors showing good performance, for high-level human-action recognition tasks. We address this as an optimization problem using genetic programming (GP), an evolution...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2022

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math10183290